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Abstract 

A Content Distribution Network (CDN) can be defined as an overlay 
system that replicates copies of contents at multiple points of a network, 
close to the final users, with the objective of improving data access. CDN 
technology is widely used for the distribution of large-sized contents, like 
in video streaming. In this paper we address the problem of finding the 
best server for each customer request in CDNs, in order to minimize the 
overall cost. We consider the problem as a transportation problem and a 
distributed algorithm is proposed to solve it. The algorithm is composed 
of two independent phases: a distributed heuristic finds an initial solution 
that may be later improved by a distributed transportation simplex algo- 
rithm. It is compared with the sequential version of the transportation 
simplex and with an auction-based distributed algorithm. Computational 
experiments carried out on a set of instances adapted from the literature 
revealed that our distributed approach has a performance similar to its 
sequential counterpart, in spite of not requiring global information about 
the contents requests. Moreover, the results also showed that the new 
method outperforms the based-auction distributed algorithm. 

1 Introduction 

A Content Distribution Network (CDN) can be defined as an overlay system 
that replicates copies of contents at multiple points of a network, close to the 
final users, with the objective of improving data access. Some problems need to 
be considered in order to implement and operate a CDN. At first, the servers 
need to be strategically located in the network, aiming to cover a wide range of 
potential clients. This problem is called the Server Placement Problem (SPP)[3]. 
Once servers are placed, and given a short-term forecast (say, for the next hours) 
of the clients requests for contents, it is necessary to decide how contents should 
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be replicated among the servers in order to minimize the expected access time. 
This second problem is called the Replica Placement Problem (RPP) [T7]. The 
last one, called the Request Routing System Problem (RRSP) consists in, given 
the client requests and the configuration of the CDN (server/link capacities, 
latency, costs, etc), finding the best assignment of each request to a server that 
holds a copy of the required content. Remark that, while it is usual that most 
requests will be really assigned to a single server (the strict meaning of the 
word "assignment" in the optimization community), the problem definition al- 
lows splitting the demand of a request on different servers. This work proposes 
viewing and solving the RRSP as a Transportation Problem (TP). The Trans- 
portation Problem (TP) is a classical optimization problem [T] that deals with 
sources where a supply of some items are available and destinations where the 
items are demanded. The objective is finding a minimum cost way of trans- 
porting the items. Given its vast range of applications, the TP has obviously 
been extensively studied. The traditional sequential approach can only tackle 
a distributed problem after all data is collected in a central node, then the so- 
lution is broadcast over the network. However, it becomes increasingly harder 
to use this sequential approach, as the volume of data to be collected over a 
large wide area network increases. The difficulty lies in maintaining the data 
updated in a highly dynamic environment. In fact, the exponential growth of 
the Internet over the last three decades could only be sustained because some 
of its key protocols were designed as distributed algorithms. 

In this work, we propose a distributed version of the Transportation Simplex, 
an algorithm that was devised by Dantzig as a specialization of the Simplex 
algorithm for general linear programming problems p^. Although the Trans- 
portation Simplex is still the most usual algorithm for the sequential TP, to the 
best of our knowledge, no effort was made to obtain solution for its distributed 
version, where the sources and destinations are nodes of an actual network and 
the problem data is distributed among them. 

In this paper we also compare the proposed algorithm with an auction-based 
algorithm. The auction algorithm is a parallel relaxation method for solving the 
classical assignment problem and has been generalized to solve linear transporta- 
tion problems in [S] . The algorithm operates like an auction, there is a price for 
each object, and at each iteration, unassigned persons bid simultaneously for 
objects thereby raising their prices. Objects are then awarded to the highest 
bidder. Although it has been proposed for shared memory multiprocessors, we 
adjusted it so that it also executed in distributed environments. 

The remaining of the paper is organized as follows. Section [2] provides an 
overview of related works that solve the transportation problem in parallel. In 
Section [3] we show the modelling of the RSSP as a transportation problem. 
In Section |4] the developed distributed auction algorithm is shown. The pro- 
posed distributed algorithms, both the one used to obtain an initial solution 
as the distributed transportation simplex, are presented in Section [5] Section 
[6] discusses the implementations and report the results of computational tests. 
Finally, Section [7] concludes the paper. 
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2 Related Work 



In the related literature, two well known approaches are presented to solve 
the transportation problem in parallel. The first one consists of parallel ver- 
sions of the network simplex algorithm. Those algorithm solves the minimum 
cost network flow problem, that includes the transportation problem as a par- 
ticular case. The second approach considers auction algorithms to solve linear 
transportation problems. 

In [7] , a primal network simplex algorithm, that applies decomposition of the 
original problem into subproblems, is proposed. Three variants of this parallel 
algorithm are presented. In the first one, each process executes the same algo- 
rithm over a different subproblem. Processes exchange parts of subproblems, 
periodically, in order to eliminate arcs that connect subproblems of different 
processes, called across arcs. In the second strategy, additionally to the pro- 
cedure proposed previously, some processes are dedicated uniquely to perform 
pricing operations. Finally, in the third method, a process can interrupt an- 
other one, to transfer a subproblem when an across arc appears in the solution. 
In all cases, the global optimal solution is found when all local problems are 
solved and there are no candidate across arc to enter the solution. Tests were 
performed in a shared memory multiprocessor. Almost linear speedups were 
obtained on instances corresponding to multi-period network problems. 

A parallel dual simplex algorithm for shared memory parallel computers is 
introduced in [15j . The traditional simplex (primal or dual) does not offer many 
possibilities for parallelism, because it moves from a feasible basic solution to 
another by performing one pivoting operation at a time. Then, Thulasiraman 
et al. proposed a dual simplex method that performs concurrent pivoting oper- 
ations, by executing each of them over small subgraphs. This operation consists 
of traversing the spanning tree of the graph, corresponding to a basis, from 
leaves to root, identifying clusters that can perform concurrent pivoting. Each 
process consists of a graph node and the communication between processes uses 
shared memory. Tests were run in the multiprocessor BBN Butterfly. Good 
speedups were observed only for a small number of processors, adding more 
processors did not decrease computational times. This indicates that the test 
instances only allowed a limited number of concurrent pivots. 

In llQl the authors presented a similar approach by introducing a syn- 
chronous parallel primal simplex algorithm for transportation problems on shared 
memory parallel computers. The method also performs concurrent pivoting op- 
erations by distributing the cost matrix across the machine with each processor 
storing a contiguous block of rows. Comparisons were made with parallel as- 
signment problem algorithms but there was no clear winner. 

Later, a parallel primal simplex method for minimum cost network flow 
problem is also proposed in [2] . A greater degree of parallelism is achieved by 
also breaking down the pricing (not only the pivots) into different processors. 
A monitor is used to synchronize parallel processes and to schedule tasks to 
processors. Those tasks can (i) select an arc and make the pivoting, (ii) update 
the dual variables, or (iii) perform the pricing operation. Tests were executed 
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on a shared memory multiprocessor, Sequent Symmetry S81 with 20 processors 
and 32 Mbytes of shared memory. For the instances tested, an average speedup 
of 8.26 was reached. 

A recent review of paraUel simplex methods is presented in [11) . According 
to it, there is no parallelization of the simplex method that offers significant 
improvement in performance for large and sparse LP problems. However, some 
implementations of parallel solvers applied to dense or specific classes of linear 
programs were successful. 

Concerning auction-based parallel algorithms, Bertsekas and Castanon [5] 
proposed the converting of the transportation problem into an assignment one, 
intending to use a previously introduced auction algorithm for the assignment 
problem with small modifications to exploit the special structure of the network. 
Later, they discussed about parallel implementations of the auction algorithm in 
shared memory multiprocessors with several degrees of synchronization among 
the processes for the assignment problem [B]. More recently, a particular case 
of the assignment problem, that deals with the lack of global information and 
limited communication capability, is solved by a distributed auction algorithm 
that maximizes the total assignment within a linear approximation of the opti- 
mal solution |16j . In [8] , a novel variation of the assignment problem is tackled 
by an auction algorithm that considers a hierarchy of decision makers. 



3 Modeling the RSSP as a Transportation Prob- 
lem 

Let G = (V, A) be a complete bipartite graph, where the vertex set V is 
divided into subsets Vi with n sources and V2 with m destinations. The arc («, j) 
from source i to destination j has a cost Cij . Each source i e Vi is associated with 
a capacity bi , while each destination j G V2 is associated with a demand dj . It is 
assumed that capacities and demands are balanced, i.e., J2ieVi ~ ^jeV2 ^J'^ 
if this is not the case, an artificial vertex can be introduced to absorb the excess 
of capacity or demand. The Transportation Problem (TP) consists in solving 
the following linear program; 

ieVi jeV2 

s.t. Y,x,,^d,, yjeV2, (2) 
ieVi 

^Xij=bi, yieVi, (3) 

x,,>0, yi£VujeV2, (4) 

where variables Xij represent the flow from i to j. Remark that any of the 
equalities in (2|3| is implied by the remaining m + n — 1 equalities and can be 
removed. 
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To cast the RRSP as a TP, some considerations are made about typical 
CDNs: 



• A CDN is composed by a set I of servers, spread around a large geograph- 
ical area, that can communicate using the underlying Internet structure. 

• A CDN hosts a set C of distinct contents. At a given moment, a server 
i Cz I has a subset C'i of contents, determined by the last solution of the 
Replica Placement Problem. Alternatively, it can be said that for each 
c G C, /c C / is the subset of servers that have c. It is usual that only 
a single central server (which is actually a datacenter) contains a copy of 
all contents. 

• The servers have a limited amount of outgoing bandwidth available. The 

bandwidth of server « G / is given by bi . 

• At a given moment, there is a set J of client requests. A request j & J 
is associated with a content c(j) G C and with a required bandwidth of 
dj. The value of dj is related to Quality of Service issues. Sometimes dj 
is defined as a function of the content c{j). J{Ci) C J is the subset of 
requests for a content in Cj. 

• There is a subset ii' C J of the servers (called client servers) that not only 
hold contents, they arc also associated to the client requests. In other 
words, each request j G J is associated with a server k{j) G K. When 
content c{j) belongs to Cfe(j), the request can possibly be attended by k{j) 
itself. Otherwise, it must be redirected to servers that contain c(j). In 
any case, the total consumption of bandwidth of the servers attending j 
is dj. 

• It is assumed w.l.o.g. that there exists at most one request in server i for 
a content c, so a request j & J can be alternatively identified by the pair 



• The CDN administrator can define the communication cost Ci^i^ between 
two servers i\ and i2 in /, per unit of traffic, based on parameters like 
distance or latency. If i\ =12, the cost is zero. This means that fully 
attending a demand j by fc(j), if it is possible, costs zero. But this would 
still consume the bandwidth of k{j) by dj units. 

Given the above considerations, the RRSP can be modeled as the following 
linear program: 




(5) 



s.t. 




(6) 
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^ x,j <h, yiel, (7) 

Xij > 0, Vj e J, i e (8) 

where variables the bandwidth that server i uses to attend demand j. 

By associating Vi to / and V2 to J, the relation between (l][4| and ([5]|^ 



becomes evident. However, in order to obtain a perfect reduction of the RRSP 
to a TP, some details must be fixed: 



The bipartite graph induced by (5][8) is not complete. The solution is 



adding the missing arcs, between requests and servers that do not contain 
the required content, with suitable large costs. We remark that even 
when many artificial arcs are introduced, this has a small impact on the 
performance of the distributed algorithms that will be proposed. This 
happens because most of those arcs can be handled implicitly and they 
are completely ignored as soon as a first truly feasible solution is found. 

Converting ([7| to equality can be easily done by adding an artificial de- 
mand of X^ie/ ^« ~ SjGj'^i uiiits. If this quantity is negative, then the 
problem is infeasible. Anyway, while adding an artificial demand is triv- 
ial for a sequential TP algorithm, it is not so obvious when designing a 
distributed algorithm, since it requires a global data that is not readily 
available in the beginning of the algorithm. 



4 The auction algorithm for the TP 

The auction algorithm is a parallel relaxation method for solving the classi- 
cal assignment problem. The algorithm operates like an auction whereby unas- 
signed persons bid simultaneously for objects by raising their prices. Each object 
j has a price price j, with the initial prices being zero. Prices of the objects are 
adjusted upwards as persons bid for the best objects, that is the object for which 
the corresponding benefit minus the price is maximal. Only persons without an 
object submit a bid, and objects are awarded to their highest bidder. When the 
prices of some of the assigned objects become high enough, unassigned objects 
become attractive and receive new bids. Thus, the algorithm executes several 
bidding and assignment steps towards a full assignment of persons to objects 
0- 

The transportation problem is converted into an assignment problem by cre- 
ating multiple copies of similar persons or objects for each source or destination, 
respectively. Two objects are called similar if every person to whom they can 
be assigned considers them as equally valuable, while two persons are named 
similar if they assign the same value to every object. The objective of the prob- 
lem is to maximize the costs of the flows between sources and destinations. The 
bidding and the assignment phases of the auction algorithm are highly paral- 
lelizable with shared memory. The bidding can be accomplished simultaneously 
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for all persons. Similarly, the assignment can be accomplished simultaneously 
for all objects. 

Considering that the servers of a CDN communicate through a network and 
do not have a shared memory, the algorithm by Bertsekas and Castaiion had to 
be adapted to execute in a distributed way. Here the sharing of prices and bids 
is accomplished through message exchanges among sources and destinations. 
The Algorithm [l] named AuctionTP, presents our proposal for a based-auction 
distributed algorithm for the TP. Each source accomplishes the bidding phase, 
by sending flows and bids to each destination not yet completed served by it, 
and by calculating new flows and bids according with the answers received from 
these destinations. Each destination receives all the sent bids, updates the 
flows and prices of its requests and sends an acknowledge message to all servers 
containing the new prices, flows and the corresponding sources responsible for 
serving it. The procedure of updating bids (12|, offering flows (13l, prices (17), 
accepted flows (17 1 and e (|9]and 18) are accomplished according with the rules 
proposed by [5]. The algorithm terminates with an optimal solution provided 
that the transportation problem is feasible and e < l/min{m,n}, where n and 
m are the number of sources and destinations, respectively. Remark that as 
RRSP consists of a minimization problem we multiplied each cost Cy by (— 1) 
and added the highest cost among all of them to each one. 



5 Distributed Algorithm 

5.1 Sequential Transportation Simplex 

The Transportation Simplex algorithm needs an initial basic solution to start 
with. The classical Northwest Corner method, described as Algorithm [2] is a 
constructive heuristic to obtain it. The method was initially devised as an easy 
to execute pencil-and-paper algorithm. However, its solutions can be crude 
as the method completely disregards the costs, the next variable Xij that will 
receive positive flow is arbitrarily chosen. The related Minimum Cost method 
usually obtains a better initial solution by choosing a variable Xij that still can 
be increased with minimum Cij . Anyway, the n + m — I basic variables found 
by those methods define a tree in the bipartite graph G. 

The Transportation Simplex itself [S] performs a sequence of iterations, 
where basic solutions (always defined by trees in G) can be improved until 
they are proved to be optimal. Each source is associated with a dual variable Ui 
and each destination with a dual variable Vj . The construction of a dual solu- 
tion uses the constraints Ui + Vj = Cij , for each basic variable Xij . One of those 
variables, say ui, can be fixed to zero, defining the root of the tree. Starting 
from that root, the remaining dual variables are determined in a unique way by 
propagating the values over the tree. After that, the reduced costs of the non- 
basic variables can also be calculated 3jS ^ij — Dip- If all reduced costs 
are non-negative, the solution is optimal. Otherwise, a variable with negative 
reduced cost is selected to enter the basis. This variable creates a cycle in the 
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tree. Increasing the value of the entering variable by 9 changes the values of the 
other variables along the cycle, by —9 and 9, alternating. The entering variable 
is increased by the maximum possible value, which causes some variables to drop 
to zero. One of those variables is selected to leave the basis, which corresponds 
to another tree. A complete description of all those sequential algorithms can 
be found in [3]. 

5.2 Distributed Approach 

In the distributed algorithms described next, each server in / is associated 
to a process, that executes the same local algorithm, which consists of send- 
ing messages, waiting for incoming messages and processing. Messages can be 
transmitted independently and arrive after an unpredictable but finite delay, 
without error and in sequence. Processes have distinct identities that can be 
totally ordered. Initially, a server i only knows the requests (i,c), c € C^. On 
the other hand, it is assumed that a server knows which contents are present in 
each other server (possibly using a service similar to DNS) and the correspond- 
ing communication cost. The assumption is reasonable since content position 
change much less dynamically than request data. 

5.2.1 Obtaining an Initial Solution 

Distlnit is a distributed constructive heuristic, inspired by the Minimum 
Cost method, but not necessarily equivalent to it. In Distlnit, at first, each 
server tries to attend its local requests. If it is not capable of serving a local 
request, a message Serve is sent to the closest server that stores that content. 
Upon receiving this message, a server answers with an ACKoi a CivT message 
depending on its bandwidth availability. When a server receives a NACK oi an 
ACK with partial attending, it sends another Serve message to the next closest 
server with the remaining required bandwidth. When all requests are attended, 
the algorithm finishes. See Figure[T]for an example of an initial solution. Squares 
and circles represent servers and requests, respectively. The number inside the 
square is the server identification i, while the two numbers inside the circle of 
a request j represent (fc(j), c{j)). The edges indicate which servers attend each 
request. 

5.2.2 Distributed Transportation Simplex 

The distributed transportation simplex (DistTS) initiates in the server that 
has the smallest identification. It assigns zero to its dual variable ui, calculates 
the dual variables v corresponding to the requests supplied by it, and sends 
these values to the servers that hold those requests, through Varv messages. 
Upon receiving this message, each process calculates the dual variables of the 
other servers that have also contributed to attend a common request and sends 
these results through Varu messages. This procedure is repeated, alternating 
the sending of Varv and Varu messages until all dual variables are calculated, 
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Figure 1: Initial Solution 



and consequently the reduced costs. Now, each server selects its most negative 
reduced cost edge and sends a message to the corresponding server, to introduce 
that non-basic variable in the solution. Then, messages are sent along the 
candidate cycles to select a basic variable to leave the solution. In the best case, 
this distributed algorithm can process all those cycles (i.e. make pivot steps) in 
parallel. However, those candidate cycles may have intersections, which would 
cause inconsistencies if they are pivoted at the same time. So, when a conflict 
is detected, the cycle associated with the smallest reduced cost is chosen to 
continue, while the other is canceled. For every pivot that is performed, it is also 
necessary to re-calculate the dual variables. However, instead of recalculating 
all of them, only the ones that belong to the new subtree are updated. The 
new dual variables are propagated through Varu and Varv messages and all 
procedures described above are repeated until it is detected that there is no 
edge with negative reduced cost. 

The steps of the Distributed Transportation Simplex can be summarized as 
follows. Those steps are illustrated by Figures 2 to 5, where an example of 
DistTS execution is presented. 

• Build the initial dual solution tree 

In this step, ui is updated with zero, the corresponding dual variables v 
are calculated and sent in Varv messages. Upon receiving these messages, 
other processes also calculate their dual variables and send these results 
through Varu messages, as shown in Figure [2|^a). When all dual variables 
are calculated, the initial dual solution tree is built, see Figure ^h), with 
the corresponding dual variables and reduced costs, as presented in Figures 
^c) andgd). 

• Create cycles 



9 



In this step, each server selects its most negative reduced cost edge and 
sends the message Reduced-Cost to the corresponding server. This pro- 
cedure produces candidate cycles, as shown in Figure [sj^a) , where three 
negative costs are selected (cg i,cl ^ ^3 3)- 

• Select a basic variable to leave the solution 

Upon receiving a Reduced_cost message, the server sends the Cycle message 
along the candidate cycle to select a basic variable to leave the solution, 
as seen in Figure [sj^b) with reduced cost C3 3. The Cycle message is sent 
with the value 9 to select the edge flow with the smallest value. Initially, 
9 is infinity. The value —9 or +9 is assigned to each edge of the cycle, 
alternately. When all edges of the cycle are visited, the smallest flow of 
the edges is known. Then, an Update message is sent along the cycle to 
update the flows of 9 with this smallest value. A basic variable that now 
has value zero is selected to leave the solution. It is possible to select 
until n variables to leave the solution. However, when these candidate 
cycles have intersections and they are pivoted at the same time, a mutual 
exclusion problem appears what can produce inconsistent results. 

• Cancel cycles 

When a conflict is detected, the cycle associated with the smallest reduced 
cost is chosen to continue, while the other is canceled. Suppose that the 
messages Reduced-Cost corresponding to the requests 3,1 and 3,3 arrive in 
the server 1 with equal costs, and then the other Reduced^cost message 
from request 3,2 with a smaller cost reaches it. In this moment, the server 
detects a conflict, as shown in Figurejsj^c). Then, Cance/ messages are sent 
along the cycle with the highest reduced cost, see Figure [Sjd). Only after 
canceling the previous cycle, a new basic variable selection is initiated (see 
Figure [4]^ a)) and the flows are updated as in Figure |4](b) . 

• Rebuild the dual solution tree 

The new tree after the pivot is rebuilt, as shown in Figure [sja), and them 
the new dual variables are propagated through Varu and Varv messages, 
as presented in Figure [5jb). 

All procedures described before are repeated until it is detected that there 
is no edge with negative reduced cost. 

5.2.3 Complexity Analysis 

At first we analyse the Distlnit algorithm that aims to get an initial solu- 
tion for the TP problem. Each one of the client servers tries to attend its local 
requests. In a worst case scenario, this procedure can propagate n messages for 
each local request m, resulting in worst case complexities of 0{n.m) messages 
(total number of messages sent during the algorithm), 0{n) global time (max- 
imum message sequence), and 0(|Ji|) local time, where \Ji\ is defined as the 
number of local requests for each server i G /. 
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Starting from a solution provided by the Distlnit method, the DistTS initi- 
ates the procedure that calculates all dual variables by sending Varv and Varu 
messages, resulting in a complexity of 0{n.m) messages. Next, each server se- 
lects its most negative reduced cost edge and warns the elected corresponding 
server, which in turn broadcasts its election to all other servers. In the worst 
case, O(n^) messages are sent to identify candidate cycles to be processed as 
pivot steps. The conflicting cycles are canceled with a complexity of O(n^) mes- 
sages while each one of the k remaining cycles (fc < n) are processed and has to 
re-calculate its dual variables. The new dual variables are propagated through 
all servers with a total of 0{k.n) messages. The above observations leads to a 
worst case message complexity for the DistTS method of 0{p.n.m), where p is 
the number of rounds of the Simplex. 

Considering now the global time of DistTS, we note that the maximum mes- 
sage sequence to process the dual variables, identify /cancel cycles and broadcast 
new variables, are all bounded by the height of the dual solution tree, which is 
0{n). Therefore, we reach a global time complexity of 0{p.n). Moreover, the 
local time spent by a server is dominated by the process of selecting its most 
negative reduced cost edge (0(n)), finding candidate cycles (0(n)), or selecting 
a request to be served {0{m)). Since 0{m) > 0{n), we achieve a local time 
complexity of 0{m). 

6 Experimental Results 

The algorithms described in the previous section were implemented in ANSI 
C and MPI (MPICII2) [T3] for message passing. All experiments were performed 
(with exclusive access) on a cluster with 42 nodes, each one with two processors 
Intel Xeon QuadCore 2.66Hz and 16Gb of RAM under Linux (Red Hat) 5.3 
operating system. The algorithms were tested over the set of instances presented 
in [M] for the combined RPP and RRSP problems (both problems are optimized 
at once). These instances were adapted for the RRSP by fixing the locations 
of the contents on the servers. These instances are divided into three classes: 
easy, medium and hard. The instances from the easy class are not considered 
in this work. There are twenty instances for each classes, five for each value 
of n e {10,20,30,50}. The average number of requests per server is 70. The 
entire benchmark is available for download in |12j . 

6.1 Comparison of DistTS and the Sequential Version 

The first experiment, reported in Table [T] is a comparison of the new dis- 
tributed heuristic Distlnit with the sequential Minimum Cost Method (MCM), 
in terms of solution quality. The instances are identified by the label n_y, in- 
stances with y ranging from 6 to 10 belong to class medium, those where y is 
between 11 and 15 are from class hard. The second and third columns present 
the number of requests and the value of an optimal solution, respectively. The 
next two columns are the value of the solution found by MCM and the corre- 
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sponding percentual difference to the optimal. The next columns are the average 
value of the solutions found by 10 executions of Distlnit (since this algorithm 
is not deterministic), the corresponding average difference to the optimal and 
the standard deviation. Distlnit performed a little better than MCM. Some 
very large solutions values indicate the use of some artificial variables, so those 
solutions are not really feasible. MCM could not find a feasible solution on 
17 instances, this happened 14 times with Distlnit. Considering only the 23 
instances where both algorithms could find a feasible solution, the average dif- 
ference to the optimal is 2.4% for MCM and 2.2% for Distlnit. The small values 
in the last colummn indicate that Distlnit found very similar solutions on its 
different executions of the same instance. However, it should be stressed that 
the tests were performed in a "well-behaved" environment. Processor loads and 
message delays would be much more unpredictable in a real CDN environment. 

The second experiment, reported in Table [2] compare the sequential Trans- 
portation Simplex method, initialized with the MCM, with the distributed 
Transportation Simplex method {DistTS), initialized with Distlnit. First, there 
is a comparison of their performance on finding the first truly feasible solution, 
without using artificial variables. In some practical contexts, one just needs a 
feasible solution, so the methods can be stopped at that point. The table shows, 
for each method, the value of the first solution and the number of pivot steps 
necessary to find it. If the initial heuristic already produces a feasible solution, 
the number of pivots is zero. The number of steps necessary to find the optimal 
solution is also given. It can be seen that the performance of both methods 
is similar in terms of pivots required to find an optimal solution. This means 
that the fact that some pivots are being made in parallel is not affecting the 
algorithm. However, the level of parallelism observed in this experiment was 
small, it is rare that more than 3 pivots can be performed in parallel. In those 
instances, the candidate cycles are composed of too many servers and requests, 
which results in lots of intersections and, consequently, poor parallelism. Its 
due to the characteristics of the instances, which were randomly generated in 
such a way that the requests for a given content are evenly distributed among 
the servers. Therefore, that are no natural clusters of requests/contents that 
can be optimized in parallel. 



6.2 Comparison of DistTS and AuctionTP 

The third experiment, reported in Table [3] compares the DistTS and Auc- 
tionTP methods. The first and second columns present the name and the op- 
timal solution of the instances. The next four columns contain the first fea- 
sible solution, the time in seconds to find it, the total time and the number 
of exchanged messages by the DistTS. The last four columns show the same 
information for the AuctionTP. 

It is possible to observe that for instances with 10 servers both algorithms 
present similar execution time. As the number of servers increases, the Auc- 
tionTP execution time also grows. It occurs because the bidding and assignment 
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steps, executed several times until the optimal solution is found, are separated 
logically by synchronization points. In the distributed algorithm it means that 
a server can not progress to the next bidding step without receiving an acknowl- 
edge message from each other destination to which it sent a bidding message. On 
its turns, without receiving all bidding messages, destination processes do not 
update their prices and send the corresponding acknowledge messages. Clearly, 
the time spent with synchronization increases with the number of servers. 



7 Final Remarks 

This paper tackled a known problem of CDNs, the Request Routing System 
Problem, as a Transportation Problem. In the related literature, simplex and 
auction based approaches are usually presented to solve the TP in shared mem- 
ory parallel machines. However, as CDN information is spread out on servers 
that communicate only by message passing, new parallel algorithms adjusted to 
this distributed scenario have been proposed here. 

Our main contribution was the design and implementation of a distributed 
transportation Simplex algorithm adapted to solve the RRSP. It includes a 
distributed heuristic for finding an initial solution that it is interesting on its 
own. Furthermore, for comparison purposes, we also developed a distributed 
Auction algorithm based on the shared memory parallel version of [S] . 

Experiments with instances adapted from the literature were performed. The 
results pointed that the Distlnit and DistTS algorithms have a performance 
similar to their sequential counterparts, in spite of not requiring global informa- 
tion about the content requests. However, only a limited amount of parallelism 
was obtained in the tested instances. This was majorly due to the fact that in 
tested instances there were few natural clusters of requests/contents that could 
be processed in parallel. Moreover, we also verified that DistTS outperformed 
the AuctionTP in the largest instances, showing that the synchronization points 
of AuctionTP introduced a high overhead in a distributed environment. Future 
work will concentrate on the investigation of the parameter e in order to improve 
the convergence time of AuctionTP. 
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Algorithm 1: Auction Algorithm for the TP 

1 Initialization 

2 Initial Assignment: flowaj ■= dj, Vj € J, where a is an artificial 

source; 

3 for every source i that offers a content required by destination j do 

4 bidij := 0; 

5 offer_flow,j := 0; 

6 priccij := 0; 

7 flowij := 0; 

8 end 

9 £ := {maxi^ij^j{cij} x min{n,m})/2; 

10 Bidding Step executed by source i 

11 for each destination j not totally attended by source i that requires a 
content in Ci do 

12 update bidij {flowij, priccij, Cij, s); 

13 update of fer-flowij {flowij, priccij, Cy); 

14 send messa,ge{bidij , offer^flowij, i) to destination j; 

15 end 

16 Upon receiving message {ACK, locaLprice, local-flow, local-server) 
from destination j: 

17 update pricBkj and flowkj, Vfc e local server; 

18 update e; 

19 Assignment Step executed by destination j 

20 Upon receiving message {bidij, of fer^flowij, i) from every source i 
that keeps a required content: 

21 list := received messages sorted by decreasing order of bid; 

22 local-price := 0; 

23 local-flow := 0; 

24 locaLserver := 0; 

25 while (not full attended) or (list do 

26 price- flow jreceivedi := first element removed from list; 

27 local -price := local-price U 6idjj from price-flow -received^; 

28 local-flow := local-flow U updated offer-flowij from 
price _/?ow; -receitiedi ; 

29 locaLserver := locaLserver U i from price-flow jreceivedi; 

30 end 

31 send (ACJC, locaLprice, local-flow, locaLserver) to every source k € I; 
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Algorithm 2: Northwest Corner method 



A = mm{6;..rf;.}; 



Input: Integers n and m, capacity and demand vectors b and d 
Output: Flow matrix x, boolean matrix B identifying the basic variables 

1 X = 0, B = false, s' = s, d' = d; 

2 i = 1, j = 1; 

3 repeat 

4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 



Xij = A, B,j = true, b'^ = b'^ - A, d'j = d'j 
if d' 7^ and 6J = and dj = then 
I BiQ+i) = true; 
end 

if 6^ = then 
I i = i + l; 
end 

if d'j = then 

I i = j + 

end 



15 until d' = 0; 
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(c) (d) 

Figure 2: (a) Varv and Varu Messages; (b) Dual Solution Tree; (c) Tree with 
Dual Variables; (d) Tree with Reduced Costs 
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(a) (b) 
Figiire 4: (a) Select other cycle; (b) Updating the flows 
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Table 1: Heuristic solution: Distlnit x MCM 
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0.0 


0.0 


10-8 


641 
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380 743 


7 
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0.3 


0.0 
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461.161 
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77 992 9 
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57.5 


10_10 


649 


501.082 




3.1 


516,538 


3.1 


1.1 


10-11 
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471 763 


471 763 





471763 


0.0 
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10-12 
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316 065 






93,844,995 


29 591.7 


7.6 


10-13 


641 


378,231 




0.7 


379,416 


0.3 


0.0 


10-14 


659 


461.161 


360,134,315 


77,992.9 


104,635,498 


22.589.6 


54.3 


10-15 


649 


501,082 


516,506 


3.1 


519,473 


3.7 


0.8 


20-6 


1,289 


1,434,570 


298,622,394 


20,716.2 


69,422,807 


4,739.3 


106.5 


20-7 


1,356 


916,306 


974,232 


6.3 


950,868 


3.8 


0.6 


20-8 


1,314 


1,158,373 


252,787,670 


21,722.6 


186,742,182 


16,021.1 


34.6 


20-9 


1,352 


1,160,991 


814,687,624 


70,071.7 


542,353,658 


46,614.7 


18.8 


20-10 


1,367 


853,256 


864,902 


1.4 


864,245 


1.3 


0.1 


20-11 


1,289 


1,434,570 


298,622,394 


20,716.2 


90,560,889 


6,212.8 


80.4 


20-12 


1,356 


916,306 


974,232 


6.3 


951,267 


3.8 


0.7 


20-13 


1,314 


1,158,373 


252,787,670 


21,722.6 


181,488,041 


15,567.5 


19.7 


20-14 


1,352 


1,160,991 


814,687,624 


70,071.7 


481,505,793 


41,373.7 


22.6 


20-15 


1,367 


839,076 


845,561 


0.8 


845,503 


0.8 


0.2 


30-6 


2,007 


1,443,887 


1,452,805 


0.6 


1,471,337 


1.9 


0.3 


30-7 


1,963 


1,280,967 


1,299,447 


1.4 


1,311,361 


2.4 


0.2 


30-8 


2,021 


1,132,309 


1,187,076 


4.8 


1,168,246 


3.2 


1.6 


30-9 


1,991 


1,169,035 


1,219,235 


4.3 


1,226,495 


4.9 


0.4 


30-10 


1,998 


1,150,971 


111,066,244 


9,549.8 


87,874,822 


7,534.8 


50.7 


30-11 


2,007 


1,458,771 


1,467,689 


0.6 


1,487,350 


2.0 


0.2 


30-12 


1,963 


1,280,967 


1,299,447 


1.4 


1,308,424 


2.1 


0.4 


30-13 


2,021 


1,132,309 


1,187,076 


4.8 


1,168,002 


3.2 


1.5 


30-14 


1,991 


1,182,625 


1,232,757 


4.2 


1,241,409 


5.0 


0.3 


30-15 


1,998 


1,150,971 


111,066,244 


9,549.8 


68,044,774 


5,811.9 


113.0 


50-6 


3,391 


2,223,512 


232,991,437 


10,378.5 


2,326,645 


4.6 


0.9 


50-7 


3,329 


1,655,212 


1,674,253 


1.2 


1,670,467 


0.9 


0.2 


50-8 


3,214 


3,065,126 


81,444,220 


2,557.1 


100,397,235 


3,175.5 


68.0 


50-9 


3,303 


3,029,901 


3,111,283 


2.7 


3,117,665 


2.9 


0.4 


50-10 


3,295 


2,196,471 


2,250,427 


2.5 


2,227,284 


1.4 


0.5 


50-11 


3,391 


2,223,512 


232,991,437 


10,378.5 


2,333,775 


5.0 


0.7 


50-12 


3,329 


1,655,212 


1,674,253 


1.2 


1,668,489 


0.8 


0.2 


50-13 


3,314 


3,065,126 


81,444,220 


2,557.1 


162,122,968 


5,189.3 


35.3 


50-14 


3,303 


3,011,459 


163,208,074 


5,319.6 


14,219,738 


372.2 


126.4 


50-15 


3,295 


2,196,403 


2,223,243 


1.2 


2,217,503 


1.0 


0.2 
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Table 2: Transportation Simplex: Sequential x Distributed 
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4 


22 
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10 
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21 


20 6 


1,465,556 


17 


52 


1,555,328 


5 


46 
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974,232 





86 
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77 
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1,165,685 


12 


26 


1,385,258 


12 


32 


20 _9 


1,162,230 


57 


74 


1,170,782 


43 


86 
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864,902 





17 


866,121 





21 
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1,465,556 


17 


52 


1,469,270 


6 


45 


20_12 


974,232 





86 


937,778 





79 
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1,165,685 


12 


26 




11 


35 


20_14 


1,162,230 


57 


74 


1,169,135 


43 


81 
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845,561 





12 
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12 


30_6 
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33 
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42 
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36 
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49 
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63 
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60 
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9 
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8 
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25 
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7 
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12 
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27 
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3,149,847 


7 
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13 
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50-14 
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11 
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1 
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Table 3: DistTS x AuctionTP 
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1,555,328 


0.13 
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16.20 
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938,719 


0.02 


0.79 
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85.75 


167.44 
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20-8 


1,158,373 


1,385,258 


0.23 


0.44 


58,628 


1,158,373 


15.71 


233.78 


6,177,400 


20-9 


1,160,991 


1,170,782 


0.68 


1.09 


141,779 


1,160,991 


10.82 


195.10 


6,275,040 


20-10 


853,256 


866,121 


0.01 


0.32 


41,636 


853,256 


2.83 


164.35 


5,299,200 


20-11 


1,434,570 


1,469,270 


0.13 


0.71 


94,914 


1,434,570 


16.81 


171.44 


5,617,416 


20-12 


916,306 


937,778 


0.01 


0.65 


86,582 
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86.18 


168.40 


5,497,504 
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0.01 


0.19 
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2.90 


165.70 
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1,443,887 


1,471,605 


0.08 


12.71 
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1,166,714 


0.13 
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206,759 
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8.00 
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28,915,614 


30-9 
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1,222,032 


0.05 


24.74 


272,909 


1,169,035 


7.73 


502.68 


29,040,348 


30-10 


1,150,971 


1,175,020 


2.72 


13.44 


181,513 


1,150,971 


47.75 


498.32 


29,718,000 


30-11 


1,458,771 


1,486,846 


0.07 


11.85 


178,703 
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25.81 


535.75 


31,692,180 


30-12 
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1,299,569 


0.34 


12.32 
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13.41 
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28,547,466 


30-13 
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1,169,751 


0.23 


20.31 


214,672 
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7.95 


485.60 


28,909,776 


30-14 
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1,235,904 


0.07 


26.18 
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7.62 
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29,034,486 


30-15 


1,150,971 


1,171,555 


2.52 


12.48 


164,366 
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47.95 


496.07 


29,730,000 
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1.13 


165.27 


1,318,262 
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1,712.05 
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1,666,277 


1.06 


34.85 
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1,520.94 
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6.90 
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1,580.04 
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3,123,216 


1.05 


125.19 


949,613 
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135.05 
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257,829,200 


50-10 
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2,236,384 


0.90 


57.01 


455,113 


2,196,471 


164.02 
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245,177,400 


50-11 
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2,308,857 


1.08 


160.70 


1,298,531 


2,223,512 


157.29 


1,693.02 


251,864,000 


50-12 
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1,664,698 


1.07 


22.85 


165,867 
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1,522.36 
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50-13 
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21.02 
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1,427,140 
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1,549.89 


240,678,250 
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1.99 
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1,001,733 
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1.04 


49.35 


398,012 


2,196,403 


162.15 


1,626.70 


251,892,500 


Average 


1,299,608 


1,342,931 


1.15 


30.37 


285,878 


1,299,608 


40.55 


573.43 


71,242,506 
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